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Template-based domain-specific reconfigurable logic 



18.12.2003 



The invention relates to a method for creating an architecture of a 
reconfigurable logic core on an integrated circuit, the architecture comprising logic 
components, routing components and interface components. The invention also relates to a 
nfigurable logic core having an architecture created by such a method. 



reco 



The ever continuing scaling of semiconductor technology has enabled ultra- 
scale integration. Therefore, a large number of today's IC's for consumer applications are 
implemented according to the system-on-chip concept In a system-on-chip (SoC), system 
10 components (such as programmable cores, memories, coprocessors, peripherals) are 

integrated on the same piece of silicon. The on-chip integration improves performance of the 

system and reduces its cost 

Traditionally, the SoC components are implemented either as dedicated 
(hardwired) cores or as programmable (general-purpose or DSP) cores. The dedicated cores 

1 5 are characterized by high performance and the functionality is typically restricted to one 
specific function, whereas programmable cores are characterized by a relatively low 
performance and functionality which may be changed arbitrarily. Because of the dramatically 
growing IC mask set costs, the increasing importance of the cost versus performance aspect 
in emerging applications, and the competitive character of the consumer electronic market, 

20 designing SoCs using only dedicated and programmable cores does not provide a fully viable 

solution anymore. 

For these reasons, reconfigurable logic is seen today as an attractive 
alternative to the dedicated and programmable cores. Firstly, reconfigurable logic allows for 
changes in device functionality after such a device is fabricated. Secondly, it offers a better- 
25 balanced trade-off between performance and cost than programmable processors do. 

Consequently, embedding reconfigurable logic in SoCs helps to reduce the number of costly 
redesigns of IC's and extends the lifetime of the final product 

A typical example of a reconfigurable logic device is an FPGA (Field 
Programmable Gate Array). An FPGA is an array of computing elements which are 
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2 18.12.2003 
programmable to execute basic logic and arithmetic functions on the level of bits. The 
computing elements are surrounded by an interconnect network which is also programmable. 
The interconnect network enables communication between the computing elements. 
Programmable input/output elements which are placed at the outer edges of the array act as 
an interface with other system resources. 

The programmable character of reconfigurable logic devices, though 
beneficial on the one hand because of their large application space, is also a reason for their 
area, performance, and power consumption overhead compared to dedicated-logic-based 
devices (ASICs). The overhead is caused by a large number of switches, configuration 
memory cells and interconnect wires which are present in such devices. Hence, the number of 
switches, configuration memory cells and interconnect wires must be balanced against the 
need for such components. 

Because of various application areas and thus various system requirements, 
embedded FPGA (ePPGA) cores, which are fitted for integration on an SoC, must be 
available in different sizes and shapes. This is in contrast to stand-alone FPGAs that are 
usually produced in several predefined sizes and target the implementation of complete 
systems. Next to different sizes and shapes, eFPGA cores must also be cost-efficient in terms 
of area, performance and power, and they must be realizable in a relatively short time. These 
aspects are essential for designing high-quality SoCs for cost-sensitive consumer 
applications. The general-purpose architectures of today's reconfigurable logic cores are not 
fitted to meet these requirements. 



It is an object of the invention to provide a method for creating an architecture 
25 of a reconfigurable logic core, which architecture can be deployed for various purposes, and 
the implementation of which is cost-efficient in terms of area, performance and power. This 
object is achieved by providing a melhod, characterized by the characterizing portion of 
claim 1. 

The invention relies on the perception that a template can be used to describe 
30 such an architecture. The architecture can then easily be created as an instance of the 

template. The template is a model which defines logic components, routing components and 
interface components of a reconfigurable logic core. For example, logic components may be 
logic elements, processing elements, logic blocks, logic tiles and arrays in a hierarchical 
order. Routing components may comprise routing channels comprising routing tracks which 
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provide interconnection means between the logic components. Interface components may be 
input and output ports. The model is configured by a number of parameters; the value of 
these parameters is in accordance with an application domain. 

For example, an application domain may comprise data-path oriented 
functionality, random-logic oriented functionality or memory-oriented functionality. Each 
application domain requires a certain architecture of the components. E.g. a data-path 
oriented logic element must have an architecture comprising a certain number of primary 
input ports, secondary input ports, a carry input port, at least one arithmetic output port, a 
Boolean output port and a carry output port The number of foese input and output ports are 
parameters of the template. By choosing appropriate values for all parameters of the 
template, the architecture which is generated by the template can be fine-tuned for a specific 
application domain. In that case, the overhead which is caused by e.g. a large number of 
switches and interconnect wires in a reconfigurable logic core can be reduced significantly, 
while the reconfigurable logic core is still flexible enough to perform a plurality of functions 

within the specific application domain. 

The concept according to foe invention is referred to as template-based 
domain-specific reconfigurable logic. The main features of this concept are: 

a reconfigurable logic architecture which is application-domain-specific rather 

than general-purpose; 

a generic template of a reconfigurable logic architecture from which domain- 
specific instances can be derived; 

a modular design concept, in particular a modular architecture allowing 
creation of variable-size reconfigurable logic cores using a minimal number of different types 
of tiles. 

In order to guarantee a large application area, traditional FPGAs (and 
eFPGAs) are made general-purpose, which increases their cost overhead. However, SoCs 
typically target a specific application domain rather than all possible application domains. 
Because applications belonging to an application domain or a class of applications share 
similar characteristics and functions, it is thus possible to optimize a reconfigurable logic 
architecture for such a domain. La this manner a significant reduction of foe cost overhead 
can be achieved. The template according to foe invention has the following other advantages. 

The template enables a fast and flexible creation of domain-specific 
reconfigurable logic cores such as embedded FPGAs. 
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By using a generic architecture model and allowing an arbitrary change of its 
parameters, many various architecture instances can be created. This enables a systematic 
architecture space exploration with experiments on a much larger set of potentially 
interesting solutions than would be possible to generate using conventional (manual) 
methods. 

The complexity of a VLSI implementation process concerning a large set of 
different reconfigurable logic cores (template instances) can be considerably reduced if the 
specification of their architectures, in the form of a netiist or a layout, for example, can be 
generated automatically from the generic architecture template. 

If the parametrizable architecture template is also used to model architectures 
for the needs of mapping (CAD) tools (e.g. technology mapping, placement, routing), such 
tools can be made retargetable, which means that they can be deployed on various platforms. 

It is remarked that the idea of tuning reconfigurable logic to an application 
domain as such is known. The benefit of making reconfigurable logic less general-purpose 
has been recognized in the past, and various apphcation-domain-specific reconfigurable logic 
architectures have been proposed in academia, mostly for DSP type of applications. Also, the 
introduction of coarse-grain reconfigurable computing architectures (coarse-grain 
reconfigurable computing architectures are reconfigurable on the level of words instead of 
the level of bits as classical FPGAs) has been driven by the idea of the cost reduction in 
certain application areas. Examples of such architectures include: the RAA architecture of 
Hewlett-Packard and the XPP processor from PACT. Yet another concept of application- 
domain-specific reconfigurable computing has been proposed as a part of the Totem project 
at the University of Washington (Totem: Custom Reconfigurable Array Generation', 
Compton & Hauck, Proceedings of IEEE Symposium on FPGAs for Custom Computing 
Machines, April 2001), where a software package enabling an automatic creation of coarse- 
grain custom reconfigurable logic architectures, by using a predefined architecture template 
and a set of a priori known algorithms, has been developed. By a considerable reduction in 
flexibility, the Totem architectures are able to achieve the cost level which is closer to the 
cost of ASIC's rather man to the cost of FPGA's. 

It is also remarked that the concept of a parametrisable reconfigurable logic 
architecture has been used in the past In •Architecture and CAD for Deep-Submicron 
FPGAs', Kluwer Academic Publishers, 1999, Betz et al. use a parametrizable description to 
model different variants of FPGA architectures for the purpose of a flexible CAD toolset 
Such a toolset which includes a placement and routing tool called VPR (Versatile Placement 
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and Routing) as well as a packing (clustering) tool called T-VPack (Timing-driven Packing 
for VPR), can be used as a part of the mapping flow targeting any LUT-based FPGA 
architecture. The architecture model used by Bete introduces some limitations, because of 
which only relatively simple FPGA structures can be modeled The details of the Beta's 
architecture model, with a special emphasis on the automation of the architecture generation 
process from a high level description, are discussed in the referenced document written by 
Bete et al. 

However, the following aspects make the concept according to the invention 

significantly different from the concepts already known. 

Firstly, unlike application-oriented architectures from academia which have 
only been optimized towards a single application domain, the concept according to the 
invention uses a complete approach by taking into account requirements of different 
application domains. Secondly, the concept according to the invention assumes that si 
type of processing kernels maybe shared across different apphcation domains. This means 
that for certain apphcation domains that, based on their similarities, can be classified as an 
apphcation class, only one type of architecture is required. This is essential since often the 
support of very many different flavors of reconfigurabie logic architectures may be 
economically unjustified Thirdly, the invention aims at a much higher level of flexibility 
than the one offered, for example, by the architectures proposed in the Totem project ; the 
Totem architectures are optimized towards a limited set of well-defined kernels only. On the 
one hand, this increases the cost penalty, on the other hand, it lowers the risk since the 
mapped kernels can still be updated or replaced with new ones after a reconfigurabie 

architecture is implemented in silicon. 

Also, the Betes model of a reconfigurabie architecture differs significantly 
from the template of a reconfigurabie logic architecture according to the invention. Firstly, 
the main purpose of the Betz's model is achieving flexibflity in the generation of routing 
architectures for a mapping tool. As a consequence, the information about the logic block in 
such a model is reduced to very few parameters mat are essential for the proper functioning 
of the tool. In principle, only the routing architecture can be generated, while logic blocks are 
modeled as black boxes of the specified granularity. In contrast, the template according to the 
invention defines a complete architecture of a reconfigurabie logic device, that is, all 
functional blocks (logic and input/output blocks) and the associated routing resources. 
Furthermore, the template according to the invention can be applied both to a mapping CAD 
flow and a physical design flow (e.g. layout generation). Secondly, the Betz's model targets 
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6 18.12.2003 
conventional general-purpose FPGA architectures. It assumes a simple k-input LUT as a 
basic logic element of such architectures; the LUTs can be clustered together forming a 
coarser logic block. This is in contrast to the template according to the invention, which is 
meant for the modeling of application-domain oriented architectures. Thus, the values of the 
template parameters depend on the target application domain. Besides, basic logic elements 
in our model can be much more complex than a single k-LUT element as assumed in T- 
VPack and VPR. Thirdly, the Betz's architecture model is based on four levels of hierarchy, 
while our architecture template features five levels; the additional level of hierarchy in our 

model allows an unambiguous description of functionally different reconfigurable logic 
structures. 

A further remark is that not only the above-mentioned differences with respect 
to already known approaches make the concept according to the invention particularly 
advantageous. Another important distinctive feature is the combination of the concept of the 
apphcation-domato-specialization of reconfigurable logic architectures with the concept of 
their automatic generation (derivation) from a generic architecture template. This 
combination defines the complete methodology, as will be appreciated by a person skilled in 
the art. 



It is noted that US 6,476,636 discloses an architecture of specific commercial 
eFPGA (Actel Corporation). The complete device is assembled from tiles, which are strictly 
defined. The document does not address the problem of asymmetry of the routing 
architecture. 

Finally, it is noted that US 6,301,696 discloses a methodology for creating so- 
called •hardened' FPGA's. 'Hardening' means bypassing on-state switches of the 
programmed FPGAs with metal connections, which leads to a performance improvement 
The silicon area of final FPGA is, however, the same as a classical FPGA. The term 
'template' is used to describe an uncommitted (un-configured) FPGA device. 

An embodiment of the method according to the invention is defined in 
claim 2. In this embodiment the template comprises an array, the array comprising a plurality 
of logic tiles, and the number of logic tiles being a first parameter. A further embodiment is 
defined in claim 3, wherein the aspect ratio of the array is a second parameter. 

Claim 4 defines a further embodiment of the template according to the 
invention. In tins embodiment, the template further comprises: 

at least one simple input/output tile, the simple input/output tile being coupled 
to a first logic tile; 
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at least one input/output tile with routing functionality, the input/output tile 
with routing functionality being coupled to a second logic tile; 

a coiner routing tile, the comer routing tile being coupled to at least two 

input/oulput tiles. 

5 Claim 5 defines an embodiment of the logic tiles according to the invention. In 

this embodiment, at least one of the logic tiles comprises: 

a logic block, the logic block comprising a plurality of logic block ports; 
routing resources, the routing resources comprising: 

- a plurality of routing tracks; 

10 . logic ports, the logic ports being arranged to couple the logic block ports to a 

neighboring logic tile; 

- routing ports, the routing ports being arranged to couple Ihe routing tracks to 

a neighboring logic tile; 

- direct ports, the directs ports enabling a direct connection of the logic block 

15 with neighboring logic tiles. 

Claim 6 defines an embodiment of the logic block according the invention. In 

this embodiment, the logic block comprises: 

a plurality of processing clusters, the number of processing cluster being a 

third parameter, wherein at least one of the processing clusters comprises a plurality of 
20 serially connected processing elements, the number of processing elements being a fourth 

parameter, and the processing cluster farther comprising a plurality of first secondary input 

ports, a first carry input port and a first carry output port; 

a first multiplexer block, the first multiplexer block being arranged to be 

controlled by control signals issued by a first input selection block, the first multiplexer block 
25 being arranged to make a selection from first intermediate signals issued by the processing 

elements; 

an output selection block, the output selection block being arranged to receive 
the selection of the first intermediate signals and to determine the number of output signals of 
the logic block, die output selection block farther being arranged to generate the output 
30 signals and to send the output signals to output ports of the logic block; 

a flip-flop block, the flip-flop block being arranged to register the output 

signals. 

Claim 7 defines a further embodiment of die logic block according to the 
invention, wherein the first input selection block is arranged to couple the first primary input 
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ports to second primary input ports, the second primary input ports being comprised in the 
processing elements, and to select input signals; the first input selection block further being 
arranged to accept output signals of the logic block as input signals such that a feedback loop 
is realized. 

Claim 8 defines an embodiment of the processing elements according to the 
invention. In this embodiment, at least one of the processing elements comprises: 

a plurality of serially connected logic elements, the number of logic elements 
being a fifth parameter; 

the second primary input ports; 

a plurality of second secondary input ports, the second secondary input ports 
being coupled to third secondary input ports comprised in the logic elements; 

a second carry input port, the second carry input port being coupled to a third 
carry input port comprised in a first one of the serially connected logic elements; 

a second carry output port, the second carry output port being coupled to a 
third carry output port comprised in a last one of the serially connected logic elements; 

a plurality of first arithmetic output ports; 

a first Boolean output port; 

a second input selection block, the second input selection block being arranged 
to couple the second primary input ports to third primary input ports comprised in the logic 
elements, and to select input signals; 

a second multiplexer block, the second multiplexer block being arranged to be 
controlled by control signals issued by the second input selection block, the second 
multiplexer block being arranged to select signals originating from second Boolean output 
ports comprised in the logic elements, and the second multiplexer block further being 
arranged to produce an output signal for the first Boolean output port; 

wherein second arithmetic output ports comprised in the logic elements are 
coupled to the first arithmetic output ports. 

Claim 9 defines an embodiment of the logic elements according to toe 
invention. In this embodiment, at least one of the logic elements comprises: 

a plurality of third primary input ports, the number of third primary input ports 
being a sixth parameter; 

the third carry input port or a further carry input port; 
the third carry output port or a further carry output port; 
one of the second Boolean output ports; 
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a plurality of foe second arithmetic output ports, foe number of second 

arithmetic output ports being a seventh parameter. 

Claim 10 defines a reconfigurable logic core having an architecture created by 
a method according to Ihe invention. The methods according to me invention are particularly 
advantageous for creating architectures for such a reconfigurable logic core. These 
architectures can be generated automatically. 



The present invention is described in more detail with reference to the 

drawings, in which: 

Fig. 1 illustrates a logic element which can be used as a building block of a 

template according to the invention; 

Fig. 2 illustrates examples of domain-specific logic elements; 

Fig. 3 illustrates the number of ports of the logic elements as illustrated in 

Fig. 2; 

Fig. 4 illustrates the functionality of the logic elements as illustrated in Fig. 2; 
Fig. 5 illustrates a processing element comprising a plurality of logic elements 

according to the invention; 

Fig. 6 illustrates the number of input and oulput ports of the processing 
element as iUustrated in Fig. 5, dependent on the type of the logic elements used as its basic 
components; 

Fig. 7 describes the functionality of processing elements built of logic 

elements of various types; 

Fig. 8 illustrates a logic block comprising clusters of processing elements 

according to the invention; 

Fig. 9(a) and Fig. 9(b) illustrate input selection blocks with one-to-one 

feedback connections and full feedback connections; 

Fig. 10 illustrates the number of foe primary input and output ports of foe logic 
block as iUustrated in Fig. 8, dependent on foe type of foe logic element 

Fig. 1 1 illustrates foe granularity of foe largest Boolean, arithmetic and 
memory functions that can be implemented in foe logic block as iUustrated in Fig. 8, 
dependent on foe type of foe logic element; 

Fig. 12 iUustrates a logic tile comprising a logic block according to foe 

invention; 
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Fig. 13(a) illustrates an example of me connectivity between selected ports of 
a logic block, direct ports, and routing tracks of a horizontal routing channel; 

Fig. 13(b) illustrates the connectivity matrices corresponding to the example as 
illustrated in Fig. 13(a); 

Fig. 13(c) illustrates a possible implementation of the connection blocks; 

Fig. 14(a) illustrates two different types of segment connection patterns; 

Fig. 14(b) illustrates three types of programmable switches; 

Fig. 1 5 illustrates an example of a routing architecture with a routing channel 
consisting of three tracks with length- 1 wire segments and eight tracks with length-4 wire 
segments; 



invention; 



Fig. 1 6 illustrates an array comprising logic tiles LT according to the 



Fig. 17 and Fig. 18 illustrate examples of architectures of auxiliary tiles with 
routing and of simple auxiliary tiles; 

Fig. 19 shows an example of an architecture instance of a data-path oriented 
FPGA logic block. 



25 



The architecture template according to the invention defines a way of 
20 generating a complete architecture of any type of application- domain oriented reconfigurable 
logic core (of a stand-alone or embedded FPGA) using a limited number of basic building 
blocks called tiles. It is assumed that the generated architecture is homogeneous and 
hierarchical. In a preferred embodiment of the architecture template which is described 
below, the levels of hierarchy (in rising order) define the following modules: a logic element, 
a processing element, a logic block, a logic tile, and an array of a reconfigurable logic core. 

Fig. 1 illustrates a logic element LB which can be used as a building block of a 
template according to the invention. A logic element LE is a basic Look-Up Table based 
(LUT-based) functional component of a reconfigurable logic architecture. The type TYPE of 
the logic element depends on the type of application domain (an application class). The logic 
30 element LE has the set P = {p i: 0 < i < |P|} of primary input ports, the set S = { Si : 0 < i < 

|S|} of secondary input ports, and a carry input port ci. It also has Ihe set A = {ai: 0 < i < |A|} 
of arithmetic output ports, a Boolean output port b, and a carry output port co. The number of 
ports of the logic element LE and its functionality depend on the type TYPE of the logic 
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element The type TYPE depends on the application domain for which the reconfigurable 

logic core will be used. 

Three examples of domain-specific logic elements are shown in Fig. 2. 

The number of ports and functionality of the logic elements are given in Fig. 3 

and Fig. 4, respectively. The functionality is described as foe granularity of basic Boolean, 

arithmetic and memory functions mat can be implemented in the logic element In that sense, 

the granularity is defined as the number of bits of an input vector of the maximal Boolean 

function, the number of bits of a single operand of an arithmetic function, and the number of 

bits of data input of a memory. 

Fig. 5 illustrates a processing element comprising a plurality of logic elements 

le,, 162 up to and including le^, according to the invention. The processing element 
comprises the setN - {leu 0 < i < (N|> of seriall y connected logic elements. |N| determines 
the maximal granularity (in terms of the number of bits of the input vector) of a fully 
specified Boolean function which can be implemented in the processing element The 
processing element has the set X = 0 < i < |X|} of primary input ports, the set S - 0 < 
i < |S|} of secondary input ports, and a carry input port ci. It also has the set Y = {y,: 0 < i < 
[Y|} of output ports, a Boolean output port z, and a carry output port co. 

The input ports Xi of the processing element are connected via the input 
selection block to the primary input ports p t of the |N| successive logic elements. The input 
selection block, which comprises a set of multiplexers, guarantees that dependent on the 
functional mode of the processing element the primary input ports pi of the logic elements 
always receive the correct set of signals from the primary input ports * of the processing 
element The number [X] of primary input ports of the processing element is equal to the 
cumulative number of 1 -bit inputs of the largest Boolean, arilhmetic or memory function 
(whichever is greater) that can be implemented in the processing element The |S| secondary 
input ports si of the processing element are connected directly to the secondary input ports Si 
of all i 0 gi c elements. In contrast the carry input ports ci and carry outout ports co of logic 
elements are chained together. This means that all logic elements except the first one have 
their carry input ports ci connected to the cany output port co of the preceding logic element 
The first logic element of the processing element mat is leo, has its carry input port ci 
connected to the carry input port ci of the processing element; similarly, the last logic 
element of the processing element that is lepq has its carry output port co connected to the 
carry output port co of the processing element The arithmetic output ports ai of the logic 
elements are connected directly with the |Y| output ports yi of the processing element. The 
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Boolean output ports b of the logic elements are multiplexed in the multiplexer block 
comprising a /og|N]-level network of 2:1 multiplexers. The multiplexers are controlled by the 
set U = {ui: 0 < i < [U|} of control signals which are issued by the input selection block. The 
output of the multiplexer block, which is the output of the final 2:1 multiplexer in this block, 
connects to the Boolean output z of the processing element 

The number of input and output ports of the processing element, dependent on 
the type TYPE of the logic elements used as its basic components, is given in Fig. 6. Fig. 7 
describes the functionality of the processing elements built of logic elements of various types 
TYPE. 

Fig. 8 illustrates a logic block comprising clusters of processing elements pe,, 
pea up to and including pe^, according to the invention. A logic block comprises the set M = 
ipa: 0 < i < |M|} of processing elements, which are organized in |KJ parallel clusters of 
serially connected processing elements. The number of processing elements in a cluster 
depends for example on the word-size used in certain applications. Each cluster is 
characterized by an independent set of secondary input ports % and independent carry input 
ports cij and carry output ports co ; . The output signals of the logic block can be registered, 
which means that they can be synchronized with a clock signal. The output signals can also 
be fed to the inputs of the logic block allowing the realization of more complex logic 
functions or functions with feedback loops. It is noted that input pins, such as the secondary 
input ports tj and the carry input port cij, can sometimes be shared or merged because they are 
used exclusively. 

The logic block has the set I = ft: 0 < i < |I|} of primary input ports, and |0| 
feedback ports that are connected to the ports in the output port set O = {<*: 0 < i < |OI} of the 
logic block. The logic block also has the set T = ft: 0 < i < |T| a |T| = |S||K|} of secondary 
input ports. A first |S| inputs of the set T, that is t,, . .., belong to Ihe first cluster of 
processing elements, a second |S| inputs of the set T, that is tfrti, t 2 . |s| , belong to the 
second cluster of processing elements, etc. The logic block has also |K| carry input ports ck 
and |K| carry output ports co,, wherein T is the cluster index such that 0 < i < |KJ. 

The |I| primary inputs and |0| feedback inputs are fed to the input selection 
block comprising a set of multiplexers. The input selection block of the logic block serves 
two purposes. Firstly, if the number of primary input ports of the logic block is lower than the 
number of primary input ports of the processing elements of all clusters, that is if |I| < |M||X|, 
the input selection block implements a full connectivity between primary inputs of Ihe logic 
block and the primary inputs of the processing elements. The full connectivity guarantees the 
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required level of (routing) flexibility (which is particularly essential for random logic 
functions) at a reduced implementation cost This is because the reduced number of input 
ports of the logic block yields the reduced amount of routing resource hardware. For 
architectures in which the number of primary input ports |X| of the processing element is 
determined by the number of bits k of the input vector of the largest Boolean (random logic) 
function that the processing element can implement (i.e. |X| = k), the following empirical 
formula can be used to determine the relationship between the number of primary inputs |X| 
of the processing element and the number of primary inputs |I| of the logic block comprising 

|M| processing elements: |I| = |X|/2 (|M| + 1). 

Secondly, the input selection block allows the realization of the feedback if the 
signals from the set O of the feedback (output) ports of the logic block are selected as the 
inputs of the processing elements. Dependent on the target application domain, the input 
selection block of the logic block can be designed with either one-to-one feedback 
connections or full feedback connections. The one-to-one feedback connections are typical 
for data-path-dominated architectures, and allow realization of sequential arithmetic modules 
such as counters, incrementers, and decrementers, in which one of the arguments receives the 
registered signal from the output For that reason, the one-to-one feedback connections 
connect the |0| output ports of the logic block to the |M|- |X| primary input ports of all 
processing elements, such that the output port o, of the logic block, associated with the i-th 
bit of the arithmetic output is connected to the primary input of the processing element that 
is associated with the i-th bit of the first arithmetic argument 

In contrast the full feedback connections connect all |0| output ports of the 
logic block to all |M|-|X| primary input ports of the processing elements. This type of 
connections is typical for random-logic-oriented architectures, and it allows implementation 
of complex Boolean functions (then the feedback signals are not registered), or different 
types of finite state machines (then the feedback signals are registered). The input selection 
blocks with one-to-one feedback connections and full feedback connections are illustrated in 

Fig. 9(a) and Fig. 9(b), respectively. 

In Fig. 8, the outputs of the input selection block are connected to the primary 
input ports in the sets X of successive processing elements. The first |S| secondary input ports 
in the set T of the logic block are connected to the secondary input ports in the set S of all 
processing elements of the first cluster. In contrast the i-th carry input port cii of the logic 
block is connected via a 2:1 multiplexer to the carry input port ci of only the first processing 
element of the i-th cluster. The remaining processing elements of that cluster have their carry 
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input ports and cany output ports connected serially. The cany output port co of the last 
processing element within the i-th cluster is connected to the i-th carry output coi of the logic 
block. To enable a serial connection of clusters, the 2:1 multiplexer at the carry input port of 
the first processing element in the i-th cluster (except the first cluster) allows the selection 
between the signal from the carry input port cii of the logic block and the signal from the 
cany output port co of the i-th cluster. 

The |S| secondary input ports of the processing elements belonging to the i-th 
cluster receive signals from the i-th set of secondary input ports of the logic block, that is 
from ports tu-D-pi+i, . . ., Furthermore, the cany input port of the first processing element 
of the i-th cluster receives a signal from the i-th cany mput port cii of the logic block. The 
remaining processing elements of tbe i-th cluster have their carry input ports and carry output 
ports connected serially. The carry output port co of the last processing element within the i- 
th cluster is connected to the i-th cany output port coi of Ihe logic block. 

The multiplexer block of the logic block is a fog-|M|-stage network of 2: 1 
multiplexers which are controlled by the control signals from the set W = {wj: 0 < i < | W|} 
originating from the input selection stage. The multiplexers of the first stage select between 
signals from the Boolean output ports z of successive pairs of processing elements. Each 
multiplexer of the second stage selects between a pair of signals coming from the outputs of 
successive multiplexers of the first stage; each multiplexer of the third stage selects between 
a pair of signals coming from the outputs of successive multiplexers of the second stage, etc. 
The output signals of multiplexers in all stages are directed to output ports of the multiplexer 
block. This is in contrast to the multiplexer block of the processing element, in which the 
output signal of only the final multiplexer (i.e. in the last stage) is directed to an output port 
of the multiplexer block. 

The signals from the output ports of the multiplexer block and signals from the 
first |Y| output ports of all processing elements are connected to the inputs of the output 
selection block. The output selection block is a multiplexer network which determines Ihe 
final number of output signals of the logic block as well as the ports on which these signals 
appear. It is assumed that all output signals of the multiplexer block and all first |Y| signals of 
the processing elements can be chosen as logic block outputs. The signals from ihe output 
selection block are directed to the flip-flop block The flip-flop block allows any output of the 
logic block to be registered. The output signals of the flip-flop block, registered or not, are 
directed to the |0| output ports of the logic block. 
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Fig. 10 illustrates the number of the primary input and output ports of me logic 
block dependent on the type TYPE of the logic element Fig. 1 1 illustrates the granularity of 
the largest Boolean, arithmetic and memory functions that can be implemented in the logic 
block dependent on the type TYPE of the logic element 

Fig. 12 illustrates a logic tile comprising a logic block LB according to the 
invention. The logic tile is a main building block of a reconfigurable logic architecture. It 
comprises a logic block LB and routing resources of the logic block LB. The routing 
resources define the number of routing tracks in the horizontal and vertical routing channels, 
their segmentation, and the way how routing tracks connect to the ports (pins) of the logic 
block. The routing resources also define the types of programmable switches that link the 

routing wire segments together. 

The logic tile has three different types of ports: logic ports L L (left), Lr (right), 
Lt (top) and Lb (bottom), routing ports Rhl (horizontal left), Rhr (horizontal right), Rvr 
(vertical top), Rvb (vertical bottom), and direct ports Di (inputs) and Do (outputs). The logic 
ports are used to connect the ports of the logic block to the routing tracks of neighboring 
tiles; the routing ports are the end terminals of the routing tracks in the logic tile and are used 
to connect to routing channels of neighboring tiles; the direct ports enable a direct connection 
to neighboring logic tiles, that is without passing programmable switches. 

L in Fig. 12 denotes the set of all logic block ports of the logic block LB, 
which includes the sets of the primary input ports I, secondary input ports T, and carry input 
ports d, as well as the sets of output ports O and carry output ports Co, that is L = I u T u d 

uOuCo. 

The logic block ports in the set L of the logic block LB are connected to the 
ports in the sets L L and L T of the logic tile. The ports in the set L L connect to the routing 
tracks of the neighboring logic tile on the left via the ports in the set Lr of the left 
neighboring logic tile; the ports in the set Lr connect to the routing tracks of the neighboring 
logic tile on the top via the ports in the set L B of the top neighboring logic tile. The ports in 
the set L of the logic block LB also connect to the routing tracks within the logic tile. The 
connections of the logic block ports in the set L to the routing tracks of the logic tile are 

realized in so-called connection blocks. 

The connectivity in the connection blocks is described using a connectivity 
matrix. The rows of the connectivity matrix are elements of the routing port sets, while the 
columns are elements of the logic block port sets. The connectivity matrix is filled with 
values '0' and T. The value '1 ' at the (ij) position in the matrix means that a connection is 
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present between an i-th routing track and a j-th logic block port, while the value *0' means 
that no connection is present The connection blocks of the logic tile and thus their 
corresponding connectivity matrices, are described by functions a T , a B , a L and a R , such that: 

- o T : (Rhl x L B ) -> {0,1}; 
5 -ob:(RhlxL) {0,1}; 
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- at: (Rvr x L R ) -> {0,1}; 
-oriCRvtxL) -> {0,1}. 

It is noted that these matrices can also be considered to be parameters of the 
template. The contents of the matrices can be generated automatically using an algorithm. 

The connectivity in direct connection blocks, that is between logic block ports 
and the direct ports of the logic tile, is defined in a similar way. In this case, the rows of the 
connectivity matrix are addressed by the elements of the direct port set Di or Do, and the 
columns by the elements of the logic block port set L. The direct connection block for inputs 
is described by the function p fc while the direct connection block for outputs by the function 
Po. It is noted that the connectivity matrix of the direct connection block for inputs has its last 
|0|+|C 0 | columns filled with values *0' (no connections to Ihe oulput ports of the logic block), 
whereas the connectivity matrix of the direct connection block for outputs has its first 
|I|+|T|+|Ci| columns filled with values *0' (no connections to the input ports of the logic 
block). The connectivity functions fa and 0 O that describe the filling of connectivity matrices 
for direct ports are defined as follows: 

-0i:(DixL)-> {0,1}; 

.po:(D 0 xL)-> {0,1}. 

The input and output ports of the logic block that connect to exactly the same 
set of routing tracks (via the logic ports of the logic tile) as well as to the same set of direct 
input and direct output ports of the logic tile, respectively, can be reduced to a single port 
only. This allows a reduction of the implementation cost of the routing architecture. 

In Fig. 13(a) an example of the connectivity between selected ports of the 
logic block, the direct ports, and the routing tracks of the horizontal routing channel is shown. 
Fig. 13(b) shows the corresponding connectivity matrices and Fig. 13(c) shows a possible 
implementation of the connection blocks. 

The segmentation (length) of the routing tracks (i.e. the number of logic 
blocks the routing tracks span before being separated by programmable switches), the switch 
block architecture (i.e. the way how routing tracks in horizontal and vertical routing channels 



PHNL03 1466EPP 



17 



18.12.2003 



connect together), and Ihe type of programmable switches are defined by the function such 
that X: (Rhl x Rvr) -» {0,a>i} . The function X describes the switching matrix. The rows of the 
switching matrix are elements from the routing port set Rhl, and the columns are the 
elements from the routing port set Rvr- The switching matrix is filled with value '0' or with 
elements <m from the set £2, such that £2 = {a* a* e N \ {0} a 1 <: i <, |£2|} wherein N is the set 
of natural numbers. The set £2 is the set of the switching point types. 

A switching point type is defined by the segment connection pattern and the 
type of programmable switch used to create the connection between routing track segments. 
The segment connection pattern defines the way of connecting a routing track segment to the 
horizontal and vertical track segments that correspond to it The programmable switch 
defines an implementation of a single connection between a pair of the routing track 
segments in the switching point The size of the set £2 is thus determined by the number of 
combinations of the segment connection patterns and programmable switch types, and 
elements oh of that set are numbered accordingly. For example, for two different types of Ihe 
segment connection patterns (e.g. 'disjoint' and 'half in Fig. 14(a)) and three types of 
programmable switches (e.g. a pass transistor switch, a dual-pass gate switch, and a bi- 
directional buffered switch in Fig. 14(b)), six different switching points ©i, . . ., <d 6 are 
possible. If two routing tracks that cross have no connection, the value '0' is placed in the 
corresponding position of the switching matrix. 

The horizontal and vertical tracks in the logic tile end with so-called wire 
twisters. Thanks to foe wire twisters, foe routing resources of each logic tile can be made 
identical. Consequently, only one logic tile type suffices to build a reconfigurable logic core, 
ratfaer than very many different ones. The wire twisters are needed if foe routing architecture 
includes routing segments which span more than one logic block LB (i.e. routing segments 
with a length greater than 'length-1 ')• hi that case, segments of equal length which span more 
than one logic block LB must be twisted (see Fig 15(b)). Furthermore, foe total number of 
tracks of a given length must always be a multiple of that track length. For example, foe 
acceptable numbers of routing tracks of foe length-4 are: 4, 8, 12, 16, etc. Wire twisting in 
horizontal and vertical routing channels is defined by functions Gh and 9 v , respectively, such 
that 

- Oh: (Rhl x Rhr) -» {0,1}; 

- 9 V : (Rvr x Rvb) -> {0,1}. 

The functions 6h and 9v define horizontal and vertical twist matrices. The 
rows of foe matrices are elements of foe routing ports sets on foe left and top of foe logic tile, 
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that is Rhl and Rvr, respectively. The columns of the matrices are elements of the routing 
ports sets on the right and bottom of me logic tile, that is Rhr and Rvb, respectively. The 
matrices are filled with values '0' and T. The value « 1 ' means that a connection is present 
between the routing tracks that are associated with those routing ports. The value '0' means 
that no connection is present Typically, the horizontal and vertical twist matrices are 
identical. 

Fig. 15 illustrates an example of a routing architecture with a routing channel 
consisting of three tracks with length- 1 wire segments and eight tracks with length-4 wire 
segments. Fig. 15(a) illustrates the architecture in a conceptual way. It is noted that the 
length-1 wire segments use connection switches type 1 (e.g. a 'disjoint' segment connection 
pattern and pass-transistor-based switch), whereas the length-4 wire segments use connection 
switches type 2 (e.g. a 'disjoint' segment connection pattern and a buffer-based switch). In 
Fig. 15(b) an implementation of such an architecture is shown. The wire segments of the 
length greater than length-1 are twisted according to a modulo-length scheme. Finally, Fig. 
15(c) describes a switching matrix of the logic tile, wherein values '1 * and '2' refer to the two 
different types of switching points. The twist matrix (horizontal and vertical) describes the 
twisting mechanism of the routing tracks in the logic tile. 

Fig. 16 illustrates an array comprising logic tiles LT according to the 
invention. The top level of a reconfigurable logic architecture according to the invention is an 
array of logic tiles LT. The number of logic tiles LT comprised in the array and the aspect 
ratio of the array are parameters of the template. The logic tiles LT are surrounded by 
auxiliary tiles CRT, IORT, IOT which have a twofold function. Firstly, they act an interface 
between a reconfigurable logic fabric and the other system resources that are embedded on 
the same piece of silicon. Secondly, they complete the routing architecture. The latter is 
required because the external routing channel created by the routing resources of the logic 
tiles LT on the edge of the array is present only at the bottom and right side of Ihe array. 
Therefore, input/output tiles with routing IORT are placed on the left side and the topside of 
the array. Simple input/output tiles IOT are placed at the right and bottom side of the array. 
Additionally, a comer routing tile CRT that closes the external routing channel is placed at 
the left top comer of the array. The bold ring in Fig. 16 shows a resultant routing channel 
created in this manner. 

The logic tiles LT are abutted via their routing ports. This means that the ports 
in the horizontal left Rhl connect to the ports in the horizontal right set Rhr of a neighboring 
logic tile. Similarly, the ports in the vertical top set Rvr connect to the ports in the vertical 
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bottom set Rvb of a neighboring logic tile. The connections to the routing tracks of 
neighboring logic tiles on the left and top are implemented via pairs of ports from the set of 

ports Ll-Lr and Lr-L B , respectively. 

Examples of architectures of auxiliary tiles with routing CRT, IORT and of 
simple auxiliary tiles IOT are shown in Fig. 17 and Fig. 18. The elements of the auxiliary 
tiles CRT, IORT, IOT are defined analogously to the definition of elements of the logic tiles 
LT. The top input/output tile with routing IORT is illustrated in Fig. 17(a); it has two sets of 
input/output ports F T and G B , and three sets of routing ports, that is Rhl, Rhr and Rvb- The 
ports in the set F T connect to the system resources, while the ports in me set Gb enable the 
connection of the ports in the set Lr of a logic tile LT at the top of the array to the routing 
resources of the top inputfoutput tile with routing IORT. The routing ports in the sets Rhl and 
Rhr connect to the ports in the sets Rhr and Rhl of neighboring IORT tiles, respectively. The 
ports in the set Rvb connect to the ports in the set Rvr of a logic tile LT at the top of me 
array. The set E is the set of direct input and output ports of the tile and it connects to the 
direct input and direct output ports in the sets Di and Do of the logic tiles LT, respectively. 
The connectivity matrices yr, y B and 6V in Fig. 17(a) are defined as follows: 

- y T : (Rhl x Gb) -» {0,1}; 
-Yb:(RhlxF t )-^ {0,1}; 

-&t:<ExF t )-> {0,1}. 

The left input/output tile with routing IORT depicted in Fig. 17(b) comprises 
the same elements as the top input/output tile with routing IORT. However, the positions of 
these elements are mirrored with respect to the positions of elements in the top input/output 
tile with routing IORT. The left input/output tile with routing IORT has two sets of 
input/output ports F L and Gr, three sets of routing ports, that is Rvb, Rvr and Rhr, and the set 
of direct ports E. The ports in the set F L connect to the system resources, while the ports in 
the set Gr enable the connection of the ports in the set L L of a logic tile LT on the left edge of 
the array to the routing resources of the left input/output tile with routing IORT. The routing 
ports in the sets Rvb and Rvr connect to the ports in the sets Rvr and Rvb of neighboring 
IORT tiles, respectively. The ports in the set Rhr connect to the ports in the set Rhl of a logic 
tile LT at the left edge of the array. The connectivity matrices yt, Yr and 5l in Fig. 17(b) are 

defined as follows: 

- Yi; (Rvr x Gr) -> {0,1}; 

- Yr: (Rvr x Fl) {0,1}; 
-5l:(ExF l )-> {0,1}. 
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The corner routing tile CRT depicted in Fig. 17(c) has two sets of routing 
ports, that is Ryb and Rhr. The ports in die set Rvb connect to the ports in the set Rvr of the 
most top left input/output tile with routing IORT. The ports in the set Rhr connect to the 
ports in the set Rhl of the most left top input/output tile with routing IORT. 
5 The right input/output tile IOT depicted in Fig. 18(a) has two sets of 

input/output ports F R and G L , and the set of direct ports E. The ports in the set F R connect to 
the system resources, while the ports in the set Gx connect to the routing resources of logic 
tiles LT at the right edge of the array via the set L R of the logic tile ports. The connectivity 
matrix 5r for direct connections is defined as Sr: (E x Fr) — » {0, 1 } . 
10 1116 bottom input/output tile IOT depicted in Fig. 1 8(b) plays a similar role as 

the right input/output tile IOT, hut it is placed at the bottom of the reconfigurable logic core. 
The bottom input/output tile IOT has two sets of input/output ports F B and G T , and the set of 
direct ports E. The ports in the set F B connect to the system resources, while the ports in the 
set Gt connect to the routing resources of logic tiles LT at the bottom edge of the array via 
15 the set L B of the logic tile ports. The connectivity matrix 5b for direct connections is defined 
as5 B :(ExF B )^{0,l}. 

It is noted that the connectivity matrices X in each tile are defined identically. 
The correct functioning of the switch blocks in the logic tiles at the edge of the array and the 
input/output tiles with routing is guaranteed by the proper programming of the configuration 
memory of the reconfigurable logic core. This means, for example, that programmable 
switches of the right bottom logic tile are programmed such that no routing connection to the 
bottom and to the right of this tile is possible. 

Fig. 19 shows an example of an architecture instance of a data-path oriented 
EPGA logic block. The logic block structure has been derived from the above-described 
template setting the template parameters as follows: 

- logic element level: TYPE=data-path, |P|=2, |S|=3, |A|=1; 

- processing element level: |N]=4, |X|=8, |S|=3, |Y|==4; 

- logic block level: |M|=1 , |K|= 1 , |I|=8, |0|=4- 

The logic block of this type implements both data-path functions (up to 4-bits) 
and random logic function (up to 4 inputs). 

It is remarked that the scope of protection of the invention is not restricted to 
the embodiments described herein. Neither is the scope of protection of the invention 
restricted by the reference symbols in the claims. The word 'comprising' does not exclude 
other parts than those mentioned in a claim. The word 'a(n)' preceding an element does not 
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exclude a plurality of those elements. Means forming part of the invention may both be 

in the form of dedicated hardware or in the form of a programmed general- 
purpose processor. The invention resides in each new feature or combination of features. 
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CLAIMS: 



! a method for creating an architecture of a reconfigurable logic core on an 

integrated circuit, the architecture comprising logic components, routing components and 
interface components, characterized in that the architecture is derived from a template, the 
template being a model configured by a plurality of parameters, wherein the model defines 
5 the logic components, the routing components and the interface components, tne parameters 
having values and the values being in accordance with an application domain. 

2. A method as claimed in claim 1 , wherein the template comprises an array, the 
array comprising a plurality of logic tiles, and the number of logic tiles being a first 

10 parameter. 

3. A method as claimed in claim 2, the aspect ratio of the array being a second 
parameter. 

15 4 A method as claimed in claim 3, wherein the template farther comprises: 

at least one simple input/output tile, the simple inputfoutput tile being coupled 

to a first logic tile; 

at least one input/output tile with routing functionality, the input/output tile 
with routing functionality being coupled to a second logic tile; 
20 - a corner routing tile, the comer routing tile being coupled to at least two 

input/output tiles. 

5. A method as claimed in claim 4, wherein at least one of the logic tiles 

comprises: 

2 5 a logic block, the logic block comprising a plurality of logic block ports; 

routing resources, the routing resources comprising: 

- a plurality of routing tracks; 

- logic ports, the logic ports being arranged to couple the logic block ports to a 
neighboring logic tile; 
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- routing ports, the routing ports being arranged to couple the routing tracks to 
a neighboring logic tile; 

- direct ports, the directs ports enabling a direct connection of the logic block 
with neighboring logic tiles. 



10 



15 



20 



6. A method as claimed in claim 5, wherein me logic block ports comprise first 

primary input ports and the logic block further comprises: 

a plurality of processing clusters, the number of processing cluster being a 
third parameter, wherein at least one of the processing clusters comprises a plurality of 
serially connected processing elements, the number of processing elements being a fourth 
parameter, and the processing cluster further comprising a plurality of first secondary input 
ports, a first cany input port and a first carry output port; 

a first multiplexer block, the first multiplexer block being arranged to be 
controlled by control signals issued by a first input selection block, the first multiplexer block 
being arranged to make a selection from first intermediate signals issued by the processing 
elements; 

an output selection block, the output selection block being arranged to receive 
the selection of the first intermediate signals and to determine the number of output signals of 
the logic block, the output selection block further being arranged to generate the output 
signals and to send the output signals to output ports of the logic block; 

a flip-flop block, the flip-flop block being arranged to register the output 

signals. 



25 



7. A method as claimed in claim 6, wherein the first input selection block is 

arranged to couple the first primary input ports to second primary input ports, the second 
primary input ports being comprised in the processing elements, and to select input signals; 
the first input selection block further being arranged to accept output signals of the logic 
block as input signals such that a feedback loop is realized. 



30 8 - A method as claimed in claim 6, wherein at least one of the processing 

elements comprises: 

a plurality of serially connected logic elements, the number of logic elements 
being a fifth parameter; 

the second primary input ports; 
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a plurality of second secondary input ports, the second secondary input ports 
being coupled to third secondary input ports comprised in the logic elements; 

a second carry input port, the second carry input port being coupled to a third 
carry input port comprised in a first one of the serially connected logic elements; 

a second carry output port, the second carry output port being coupled to a 
third carry output port comprised in a last one of the serially connected logic elements; 

a plurality of first arithmetic output ports; 

a first Boolean output port; 

a second input selection block, the second input selection block being arranged 
to couple the second primary input ports to third primary input ports comprised in the logic 

elements, and to select input signals; 

a second multiplexer block, the second multiplexer block being arranged to be 

controlled by control signals issued by the second input selection block, the second 
multiplexer block being arranged to select signals originating from second Boolean output 
ports comprised in the logic elements, and the second multiplexer block further being 
arranged to produce an output signal for the first Boolean output port, 

wherein second arithmetic output ports comprised in the logic elements are 
coupled to the first arithmetic output ports. 



A method as claimed in claim 8, wherein at least one of the logic elements 



comprises: 



a plurality of third primary input ports, the number of third primary input ports 

being a sixth parameter; 

the third carry input port or a further carry input port; 
the third carry output port or a further carry output port; 
one of the second Boolean output ports; 

a plurality of the second arithmetic output ports, the number of second 
arithmetic output ports being a seventh parameter. 



10. A reconfigurable logic core having an architecture created by a method as 

claimed in any of the preceding claims. 
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ABSTRACT: 



A method is provided which creates an architecture of a reconfigurable logic 
core. The architecture can he deployed for various purposes and its implementation is cost- 
efficient in terms of area, performance and power. The invention relies on the perception that 
a template can be used to describe such an architecture. The architecture can then easily be 

5 created as an instance of the template. The template is a model which defines logic 

components, routing components and interface components of a reconfigurable logic core. 
For example, logic components may be logic elements, processing elements, logic blocks, 
logic tiles and arrays in a hierarchical order. Routing components may comprise routing 
channels comprising routing tracks which provide interconnection means between the logic 

10 components. Interfece components may be input and output ports. The model is configured 
by a number of parameters; the value of these parameters is in accordance with an application 

domain. 



Fig. 16 



PHNLQ31466 



1/15 




FIG.1 



PHNL031466 




*- 4wr5K'- **5ess£i 




o 

CM 

CD 





DM Ml n414&& 



3/15 



TYPE 


IPI 


IS! 


IAI 


Data-path oriented 


2 


3 


1 


Random-logic oriented 


4 


0 


1 


Memory-oriented 


6 


1 


2 



FIG. 3 



TYPE 

Data-path oriented 


Boolean 
2 


Arithmetic 
1 


Memory 


Random-logic oriented 


4 


1 




Memory -oriented 


5 


2 


2 



FIG.4 



TYPE 
Data-path oriented 
R andom-logic oriented 
Memory-oriented 



maxC/og pSTl + 4. 2- INI 
6-INI 



G.6 



TYPE 


Boolean 


Arithmetic 


Memory 


Data-path oriented 


fcwrlNI + 2 


INI 




Random-logic oriented 


foglNI + 4 


INI 




Memory-oriented 


/oglNI + 5 


2JNJ 


2-INI 



FIG.7 



PHNL031466 



m 



4/15 




FIG.5 



PHNL031466 



5/15 




FIG.8 



DUMI AO-f AC* 



6/15 




PHNL031466 

7/15 



TYPE 
Data-path oriented 



Random-lo gic oriented 
Memorv-oriented 



1 

IXl-iMI 
max(lXl/2-(|Mj + 1 

IXHMI 



IXI-IMl-2: 



FIG. 10 



I TYPE 


Boolean 


Arithmetic 


Memory 


Data-path oriented 


loB\M\-(lom\+2) 


IMI-iNl 




Random-loaic oriented 


los\M\- Gozn + 4) 


IMI-INI 




Memorv-oriented 


log\M\- (/oglNl + 5) 


2-IMl-lNl 


2-IMNNI 



FIG.11 



PKNL031468 

8/15 




FIG.12 



r\t in it r\r\ A A CO 

rniMLUOiHou 



9/15 




FIG.13C 



PHNL031466 



r 



10/15 




PHNL031466 



11/15 




FIG.15C 



PHNL031466 



12/15 






PHNL031466 



13/15 




FIG.17C 



PHNL031466 



14/15 




V 

« 



PHNL031466 



f 

15/15 




G.19 



CT/IB2004/052684 




This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record. 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

Jif^LACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

» 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




LINES OR MARKS ON ORIGINAL DOCUMENT 



REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 



